A greedy-navigator approach to navigable city plans 
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We use a set of four theoretical navigability indices for street maps to investigate the shape of 
the resulting street networks, if they are grown by optimizing these indices. The indices compare 
the performance of simulated navigators (having a partial information about the surroundings, like 
humans in many real situations) to the performance of optimally navigating individuals. We show 
that our simple greedy shortcut construction strategy generates the emerging structures that are 
different from real road network, but not inconceivable. The resulting city plans, for all navigation 
indices, share common qualitative properties such as the tendency for triangular blocks to appear, 
while the more quantitative features, such as degree distributions and clustering, are characteristi- 
cally different depending on the type of metrics and routing strategies. We show that it is the type 
of metrics used which determines the overall shapes characterized by structural heterogeneity, but 
the routing schemes contribute to more subtle details of locality, which is more emphasized in case 
of unrestricted connections when the edge crossing is allowed. 



I. INTRODUCTION 

Making cities easy to navigate without maps or elec- 
tronic devices is a desirable objective (although certainly 
not the only one) for urban planning. Based on models of 
how humans find their way in a partially unknown envi- 
ronment, one can evaluate street maps and choose where 
to put new streets to optimize them for better navigabil- 
ity. The setup for such a simulation is to assume every 
pair of start and finish points for a navigator with partial 
information and measure the ratio between the shortest 
path length and the actual path length. In the previous 
work of ours, an index defined by the ratio is within the 
range (0, 1] with representing a worst and 1 a perfect 
navigability of given spatial graph layouts In this 
work, we instead focus on designing optimal transport 
systems for such concept of navigability under limited re- 
sources, which is an important engineering problem and, 
mathematically, a typical example of the constraint opti- 
mization problem [2j where it is well-known that finding 
the exact optimum of such systems is hard. The problem 
gets even more complicated if the measure or object func- 
tion to be optimized is not exactly given. Constructing 
transport networks on which real navigators move pre- 
cisely correspond to this situation, due to the fact that 
the navigability considering real navigators' behavior is 
not given as a simple mathematical description [3, |3rl|| . 

Our idea of modeling real navigators' behavior is essen- 
tially exploiting the spatial information on a local level, 
and thus it can be modeled as a simple greedy rout- 
ing strategy [H, 0, @|. In this respect, our main idea on 
this work is to use the performance of a simple greedy 
routing scheme called greedy spatial navigation (GSN) 
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based on the directional information that we have in- 
troduced in Ref. along with the real shortest path 
navigation (SPN) for comparison, to optimize the inter- 
connected structures for a given distribution of vertices 
on two-dimensional (2D) space under limited resources, 
i.e., the total length of edges (9j-fl3j . In GSN, briefly 
speaking, agents move in a direction as close as possible 
to the direction of the target. If they reach a dead end, 
they backtrack to the previous point where they have an 
untested choice of route. The agents in SPN, on the other 
hand, always follow the optimal route minimizing the to- 
tal path distance connecting the source and the target. 
Instead of solving the mathematically cumbersome non- 
deterministic polynomial time (NP)-complete problem of 
finding the exactly optimal configurations, we focus on 
the realistic approach of constructing shortcuts greedily 
from a spanning tree structure as the skeleton [14j . 

Based on the simulation results from a simple shortcut 
construction scheme starting with randomly distributed 
vertices to be connected, we first find that the type of 
metric (hopping vs. Euclidean distance) is crucial to de- 
termine the final structures. In addition, the type of 
navigability (shortest vs. GSN path length) also plays an 
important role, which is reflected by the different degree 
of inefficiency caused by greedy navigators between those 
cases. The difference between the two navigability mea- 
sures is quite prominent if we allow the crossing among 
edges, which leads to the unrealistic edge condensation 
for the shortest path length but only the fat-tailed degree 
distribution for the GSN path length. 



II. SHORTCUT CONSTRUCTION 

The aim of the model is to construct navigability- 
friendly structures under the constraint of resources given 
by the total length of the edges. This fully deterministic 
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model only depends on the initial configuration of ver- 
tices without any free parameter involved. First, assume 
that we have a set of vertices in a 2D space with their co- 
ordinates given, without any prescribed edges. To guar- 
antee the connectivity and minimum initial resource, the 
minimum spanning tree (MST) T, which is the connected 
subtree minimizing the total length of edges, is generated 
with Kruskal's algorithm This T is the initial state 
of the evolving graph G so that G(t = 0) = T. Wc define 
the navigability as the path length averaged over all the 
pairs of vertices as sources and targets. 

The navigability is first classified as the one using 
GSN pathways and the other using the real shortest 
path (global optimum). Suppose that the network is 
represented as a graph of N vertices at coordinates 
ri, . . . ,rjy = (xi,yi), . . . , (a; at, Un) that are connected by 
M edges. Assume an agent stands at a vertex i and wants 
to travel to t. Let V;j = r 3 - — be the vector between 
vertices i and j and 8j be the angle between v^t and Vjj . 
In case of SPN, an agent simply follows the shortest path 
between i and t using the entire geometric and topologi- 
cal information. An agent of GSN, in contrast, moves to 
the neighbor j of i that has not been visited before and 
has the smallest 0j. If all the neighbors of i have been 
visited the navigator goes back to the vertex from which 
the navigator arrived to i, which is in contrast to the sim- 
ple greedy navigation based on the geometric proximity 
that sometimes fails to reach t due to the lack of such a 
backtracking process [H, || . This procedure is repeated 
until t is reached. 

Furthermore, we here distinguish the metrics based on 
the hopping distance (the number of hops needed to reach 
the target) and the Euclidean distance (the sum of Eu- 
clidean distance along the path). Note that the term 
"Euclidean distance" usually refers to the distance be- 
tween two points in a metric space regardless of their 
direct connection, but in this work we use this termi- 
nology as the sum of Euclidean distance along the con- 
nected pathways, in a more flexible way. The former 
is appropriate for the situation when the time spent on 
the pathway is relatively insensitive to the length of each 
segment of pathways, or the waiting time for each junc- 
tion (vertex in this case) is significant, e.g., the airline 
network or express way network with significant delays 
in the junctions due to the traffic light. On the other 
hand, the latter is more suitable for the case when the 
junction does not play a significant role, e.g., the road 
network without severe traffic and traffic light. Such a 
distinction between vehicular and pedestrian patterns is 
responsible for important structural difference that we 
will discuss in details later. Distinction between GSN 
and SPN representing whether navigators only use the 
directional information or have the ability to fully access 
the real shortest path, combined with those two differ- 
ent metrics, yields four different navigation strategies in 
total. 

At each time step t, the "shortcut" among the vertex 
pair (not already connected by an edge in G(t)), which 



TABLE I: Total edge lengths and routing performances for 
the Boston road example depicted in Fig. [TJ The total edge 
lengths and the GSN/SP distances are shown in an arbitrary 
unit. 



network 


length 


routing performance 


oriefinfil IPiet llTa^l 


1.038 x 10 b 


6.820 (GSN steps) 
4.619 x 10 3 (GSN distance) 

5.716 (SP steps) 
4.259 x 10 3 (SP distance) 


MST [Fig.IJJb)] 


3.818 x 10* 


25.78 (GSN steps) 
1.413 x 10 4 (GSN distance) 

17.02 (SP steps) 
8.173 x 10 3 (SP distance) 


GSNH [Fig. QIc)] 


1.036 x 10 s 


5.266 (GSN steps) 


GSNE [Fig. [Hd)] 


1.029 x 10 b 


4.228 x 10 3 (GSN distance) 


SPNH [Fig. [[Je)] 


1.037 x 10 b 


3.950 (SP steps) 


SPNE [Fig. Hlf )] 


1.029 x 10 :> 


4.103 x 10 3 (SP distance) 



maximizes the performance of one of the four navigation 
strategies: Greedy Spatial Navigation with Hopping dis- 
tance (GSNH) Greedy Spatial Navigation with Eu- 
clidean distance (GSNE), Shortest Path Navigation with 
Hopping distance (SPNH), and Shortest Path Naviga- 
tion with Euclidean distance (SPNE), is selected. The 
shortcut is connected by a new edge, unless the candi- 
date edge crosses one of the existing edges of G(t) in the 
2D space This shortcut construction process is re- 
peated as long as the total length l(t) of edges of G(t) 
does not exceed a length constraint Z max . Figure [T] illus- 
trates the shortcut construction model based on the set of 
vertices in the major thoroughfares of Boston road struc- 
ture [H, [l3|- The constraint Z max , in this case, is given by 
the total length of the original edges in Fig. QJa), and 
the performance is summarized in Table [I] From Fig. [TJ 
we observe that the type of metric [hopping distance for 
(c) and (e) vs. Euclidean distance for (d) and (f)] greatly 
affects the final structure of the network. 



III. RESULTS 

A. Topological properties of emerged network 
structures 

For more systematic approach, we first generate N 
number of vertices on the square with the unit length. 
Then, as described in Sect. [H] the MST is constructed 
and serves as a starting point of the shortcut construc- 
tion. Examples of constructed networks from a vertex 
configuration in the space are illustrated in Fig. [2j All 
the results reported here are from graphs with N = 10 2 
and averaged at least 20 independent graph generations. 
At the first glance, the difference between GSN and SP 
[(a) vs. (c) and (b) vs. (d)] may not be notable enough, 
but the fraction of Braess edges (defined as the edges 
whose removal enhances the greedy navig ability) [l| caus- 
ing the inefficiency, shown in Table HT1 is much smaller for 



FIG. 1: Original Boston road structure (a), MST (b), GSNH (c), GSNE (d), SPNH (e), and SPNE (f). Red edges are the ones 
in MST (b), and black edges correspond to shortcut edges added during the evolving process, for (c)-(f). Note that the edges 
of MST (b) in this case are chosen among the original edges (a). 



TABLE II: Fraction of Braess edges Pb with negative edge 
essentiality e values 0] for emerged networks with different 
greedy shortcut addition schemes, for the model network with 
N — 10 2 and Z max = 20. The e values defined with the 
hopping (Euclidean) distance are used for GSNH and SPNH 
(GSNE and SPNE), respectively. 





GSNH 


GSNE 


SPNH 


SPNE 


Pb 


12.12% 


1.249% 


21.70% 


5.215% 



GSN, from its construction purpose toward the optimized 
structure for the GSN pathways. The comparative time 
series of performance and the total edge length shown 
in Figs. EHBJa) indicate that the GSNH tends to connect 
edges with shorter distance first. 

The most noticeable feature distinguishing between us- 
ing the hopping distance and the Euclidean distance is 
the heterogeneity in the number of connections attached 
to each node, or degree distribution shown in Figs. [3J- 
Eljb). Emergence of hubs (vertices with large degrees) 
for hopping-distance-based scheme is clearly evident as 
shown in much fatter tail from Figs. |3Jb) and|U[b) than 
Figs.JSJb) and^b). Looking more closely, it is also shown 
that even some large hubs located relatively far from the 
centroid, showing the central governance (SPN) vs. de- 
centralized local governance (GSN) hub locality [Figs. [3]- 
EDJc)] . The different importance of geometric distance for 
routing schemes is reflected in the average connection 
probability for vertex pairs with a certain Euclidean dis- 
tance, shown in Figs.lSHBJd), where the connection prob- 
ability is exponentially decreased for GSNE and SPNE, 
while the relatively long-range connections are observed 



for GSNH and SPNH. The average clustering coefficient 
(C(k)} [U| as a function of degrees seem to be increased 
with the degree [Figs. EH^e)], in contrast to topologi- 
cal networks without geometric embedding. In addition, 
naturally, there is a strong correlation between the vertex 
navigator centrality (n(k)) [l[ and degrees [Figs. [3H6jf)]. 



B. Geometric properties: triangular block, angle 
and area distributions 

One can observe lots of characteristic triangular blocks 
for the optimized structures for all the cases, as shown 
in Figs. [T] and [2J Those triangular blocks are quanti- 
tatively counted in comparison to the random counter- 
parts, using the clustering coefficient of a whole net- 
work fl9| - Table Hill shows the results. Based on the 
ratio C/\/C r where Ca is the actual number of triangles 
and C r = 2M /N 2 is the expected number of triangles if 
all the triangles are formed just by chance, obviously the 
Euclidean-distance based strategy produces more trian- 
gular blocks, but for both metrics, it is also notable that 
GSN induces more triangles than SPN. Therefore, along 
with the degree distributions, the triangular block statis- 
tics also support the conclusion that GSN encapsulates 
the refined local structures. In reality, of course, due to 
other factors such as packing the buildings with square 
cross-sections into blocks, the square blocks rather than 
triangular blocks prevail. However, it is important to 
note that such a simple objective function based on nav- 
igability can generate the local block structures. Those 
triangular structures are indeed observed in reality, such 



4 




20 

B 16 
jj 14 

~- 12 



10 h 

8 
6 
4 



.(a) 



'avg. GSN steps 
jal edge length 



10 20 30 40 50 60 70 
edge addition step t 



10 u 



10" 



10" 



10" 



16 A 
□^20 v 



10 u 




CD 

o 
c 
CO 
to 

"O 

> 

03 



0.45 




degree k 



10 15 
degree k 





distance between vertex pair 



10' 

degree k 



c 

v 




10 15 
degree k 



FIG. 3: The topological properties of emerged network structures for several different cutoff values of £ ma x, in case of GSNH, 
where N = 10 2 . Time series of performance and total edge length is shown (a), where the black horizontal lines correspond 
to i max values used in (b)-(f). The others are degree distribution (b), position centrality of vertices as functions of degree (c), 
connection probability of vertex pairs as functions of Euclidean distance (d), average clustering coefficient for vertices with 
given degree (e), and average vertex navigator centrality for vertices with given degree (f), depending on the change of /max 
values. 



as historical centers of cities built by a pedestrian strat- 
egy (2(| [U ■ Such a difference is shown to be even more 
prominent when we remove the no-crossing rule, which 
will be discussed in Sect. IIII CI 

Another geometric aspect of the optimized networks is 
seen by observing the distributions of area enclosed by 
edges and the angles by adjacent edges for vertices, i.e., 



the angles between adjacent roads at the intersections. 
We observe that both the enclosed area distribution and 
angle distribution are notably distinguishable depending 
on the metrics used as shown in Fig. [7] For hopping- 
distance based strategies, more heterogeneous enclosed 
area distributions compared to the Euclidean-distance 
based strategies are observed (Fig. Eta)). In addition, 
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FIG. 4: The topological properties of emerged network structures for several different cutoff values of / ma x, 
of SPNH. 



as in Fig. [3] in case 
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FIG. 5: The topological properties of emerged network structures for several different cutoff values of / ma x, as in Fig. [3j in case 
of GSNE. 



very sharp angles (Fig. [TJb)) are abundant due to the 
existence of hubs (see Fig. [2] as well for examples), and 
the tendency is slightly more significant in SPNH than 
GSNH. On the other hand, the angles are distributed 
around ~ 60°, the characteristic angle of the regular 
triangle for Euclidean-distance based GSNE and SPNE. 
Therefore, the enclosed area and angle distributions also 
indicate the emergence of relatively more regular trian- 



gular structures with uniform enclosed area for GSNE 
and SPNE. 



C. Remarks on the no-crossing rule for edges 

So far, we have not allowed the crossing between edges 
in the construction process since we consider such a 
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FIG. 6: The topological properties of emerged network structures for several different cutoff values of Z ma x, 
of SPNE. 



as in Fig. [3] in case 
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structure for each navig ability (TV = 10 2 and Z max = 20). 



TABLE III: The clustering coefficient based on the number of 
triangles (Ca), compared to the random counterpart (CV = 
2M/N 2 , where TV and M are the numbers of vertices and 
edges, respectively). 



method 


C A 




Ca/Ct 


GSNH 


1.04 x 10" 


i 


3.20 x 10" 


i 


3.26 


GSNE 


1.89 x 10" 


l 


3.66 x 10" 


■i 


5.15 


SPNH 


6.29 x 10" 


a 


2.98 x 10" 


-i 


2.11 


SPNE 


1.56 x 10" 


i 


3.44 x 10" 


■i 


4.53 



crossing as effectively generating a new junction or ver- 
tex. What if there is no such rule? If we allow edge- 
crossing, star-graph-likc structures are naturally emerged 
for SPNH as shown in Fig.JSJc), which leads to the "con- 
densation" in terms of degree, as the bimodal distribution 



for Imax = 12, 16, and 20 shown in Fig. |9][b) [22|. Inter- 
estingly, this severe condensation is not observed in case 
of GSNH, as shown in Figs. |U[a) and H3a) (power-law- 
like fat tailed distribution but no condensation). The 
no-crossing rule, therefore, effectively prevents such a 
condensation for SPNH, while no such explicit rule is 
necessary for GSNH, because the consideration of greedy 
navigators itself naturally avoid such condensed situa- 
tions and allows the local hubs. The combinatorial ef- 
fects of taking both hopping and Euclidean distances are 
discussed in Ref. [23| as well. 

Finally, we remark that our simple model does not take 
into account the realistic number of connected roads for 
junctions (degree in the notion of graphs), which is be- 
tween 2.1 and 2.9 [2(|. Considering such restrictions may 
make the model more realistic, but we would like to em- 
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of /max, where N = 10 2 . 



phasize that our model can be very valid for much wider 
contexts, besides the street networks, without those ex- 
plicit constraints. 



IV. SUMMARY AND DISCUSSIONS 

With a simple greedy optimization under the limited 
sum of length, we have investigated the various struc- 
tural properties of optimized networks for different types 
of navigability. Due to the complexity of finding the ex- 
act optimal configurations of transportation systems, in 
practice, heuristic methods are usually adopted for real 
systems. Therefore, we believe that even though our 
scheme is simple and idealistic, it has some degree of 
implications in real transportation systems. The most 
essential findings of ours are the emergence of hubs in 



case of the consideration of the hopping distance, inher- 
ently geometric aspect of optimized structures reflecting 
the types of metrics used, and the fact that the user-based 
routing scheme (GSN) effectively induces the local hubs 
even without the explicit no-crossing rule. The emer- 
gence of triangular blocks from the simple optimization 
procedure is also notable — would city plans be optimized 
for pedestrian navigability rather than traffic planning 
and building design, then we should probably see more 
triangular blocks. Also, we would like to note that the tri- 
angular like pattern has always been found in the histori- 
cal centers of cities built by a pedestrian strategy [2(], [2l[ . 

More elaborated or realistic approaches, for instance, 
using the different types of backbones, stochastic edge 
additions (similar to the simulated annealing process); 
different objective functions such as the combination of 
navigability and length Q or among different metrics, 
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etc. are good candidates for the future work. The reality funded by MEST R31-2008-000-10029-0 (PH). 
check using human subjects, of course, would be neces- 
sary for the application to the real systems. 
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